Application of Breiman's Random Forest to Modeling Structure-Activity Relationships of Pharmaceutical Molecules

نویسندگان

  • Vladimir Svetnik
  • Andy Liaw
  • Christopher Tong
  • Ting Wang
چکیده

Leo Breiman’s Random Forest ensemble learning procedure is applied to the problem of Quantitative Structure-Activity Relationship (QSAR) modeling for pharmaceutical molecules. This entails using a quantitative description of a compound’s molecular structure to predict that compound’s biological activity as measured in an in vitro assay. Without any parameter tuning, the performance of Random Forest with default settings on six publicly available data sets is already as good or better than that of three other prominent QSAR methods: Decision Tree, Partial Least Squares, and Support Vector Machine. In addition to reliable prediction accuracy, Random Forest provides variable importance measures which can be used in a variable reduction wrapper algorithm. Comparisons of various such wrappers and between Random Forest and Bagging are presented.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of ensemble learning techniques to model the atmospheric concentration of SO2

In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...

متن کامل

Risks assessment of forest project implementation in spatial density changes of forest under canopy vegetation using artificial neural network modeling approach

Risks assessment of forest project implementation in spatial density changes of forest under canopy vegetation using artificial neural network modeling approach   Nowadays, environmental risk assessment has been defined as one of the effective in environmental planning and policy making. Considering the position and structure of vegetation on the forest floor, the main role of forest under ca...

متن کامل

Strategy for research of new pharmacologically active molecules from plants for the treatment of pathologies

Herbal medicine, botanical medicine, phytotherapy, alternative medicine or, complimentary medicine are terms used to describe the science of using plant-based materials to treat specific symptoms or diseases. People have strong belief that natural remedies are perfectly safe. Because we have strong ties to traditional culture we use herbs and spices on daily basis. Plants are an abundant natura...

متن کامل

Strategy for research of new pharmacologically active molecules from plants for the treatment of pathologies

Herbal medicine, botanical medicine, phytotherapy, alternative medicine or, complimentary medicine are terms used to describe the science of using plant-based materials to treat specific symptoms or diseases. People have strong belief that natural remedies are perfectly safe. Because we have strong ties to traditional culture we use herbs and spices on daily basis. Plants are an abundant natura...

متن کامل

Selective COX-2 Inhibitors: A Review of Their Structure-Activity Relationships

Non-steroidal anti-inflammatory drugs (NSAIDs) are the competitive inhibitors of cyclooxygenase (COX), the enzyme which mediates the bioconversion of arachidonic acid to inflammatory prostaglandins (PGs). Their use is associated with the side effects such as gastrointestinal and renal toxicity. The therapeutic anti-inflammatory action of NSAIDs is produced by the inhibition of COX-2, while the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004